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What is claimed is: 

A computer-implemented method of identifying key images in a 
document, comprising: 

extracting one or more document keywords from the document 
)nsidered important in describing the document; 

collecting one or more images associated with the document 
including information describing each image; 

generating a proximity factor for each image collected from the 
document and each document keyword that reflects the degree of 
10 correla^jon between the image and the document keyword; and 

stermining the importance of each image according to an image 
metric that combines the proximity factors for each document keyword 
and image pair. 

2. The method of claim 1 further comprising presenting the images within 
15 the docmment determined to be important on a display device. 

3. The method of maim 1 further comprising: 

ordering the document keywords according to an ordering 
criterion; an^ 

weighting the proximity factor associated with each 
20 document keyword and image pair based on the order of the 

document keyword. 

4. The method of claim 3 wherein the frequency that each document 

keyword appears in the document determines the ordering criterion 
used to order the document keywords. 
25 5. The method of claim 3 wherein Nhe ordering criterion orders the document 

keywords according to theif relationship with the subject matter of 
the document. 

6. The method of claim 1 wherein generating a proximity factor further 
comprises identifying image textVsed to describe each image. 
30 7. The method of claim 6 wherein identifying^the image text further 

comprises: 
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^ scanning a bit-mapped representation of the image for text 
information; and 

\converting the bit-mapped representation of the text 
information into image text. 
5 8. The method of claim 6 wherein the images include metadata having text 

information and identifying the image text further comprises: 

searching the metadata information associated with the 
image for text describing the image. 

9. The method of clairm 8 wherein the metadata information is compatible 
1 0 with hypertext markup language (HTML). 

1 0. The method of claim f^wherein generating the proximity factor further 

comprises lexically analyzing the image text associated with each 
image and each document keyword to determine the degree of 
correlation between an image and a document keyword. 

15 11. The method of claim 6 wherein generating the proximity factor further 

comprises performing a phonetic comparison between the image 
text associated with each image and each document keyword to 
determine the degree of correlation between an image and a 
document keyword. ^ 

20 12. The method of claim 1 wherein generating the proximity factor further 

comprises: \^ 

identifying the location of the image in the document; and 
measuring the distance between the image in the document and a 
document keyword. \^ 

25 13. The method of claim 1 1 wherein generating the proximity factor further 

comprises: 

determining the correlation\between each document 
keyword and an image according to\the distance between the 
document keyword and the image. 
30 \^ A computer appa|^u/for identifying key images in a document, 

comprising^\ V 
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a proc&sor for executing instructions; 
memory having instructions when executed on the 
processor that extract one or more document keywords from the 
document considered important in describing the document, collect 
one or more in ages associated with the document including 
information describing each image, generate a proximity factor for 
each image collected from the document and each document 
keyword that reflects the degree of correlation between the image 
and the document keyword, determine the importance of each 

to an image metric that combines the proximity 
document keyword and image pair. 
4 further comprising instructions when executed 
that present the images within the document 
imp^ort^on a display device, 
further comprising instructions when executed 
document keywords according to an 



image accordin 
factors for each 

The apparatus of claim 
on the processoi 
determined to b< 

The apparatus of claim 
on the processor 



ordering criterion 
each document 



yworc 



sight the proximity factor associated with 
and image pair based on the order of the 



document keyword. 



The apparatus of claim 16 'further comprising instructions when executed 
on the processor that use the frequency that each document 
keyword appears in the document to determine the ordering 
criterion used to order the document keywords. 
18. The apparatus of claim 16 further comprising instructions when executed 
on the processor that uses an ordering criterion that orders the 
document keywords according to their relationship with the subject 
matter of the document*. 
The apparatus of claim 14 wherein the proximity factor is generated by 

instructions executed on the processor that identify image text used 
to describe each image. 
The apparatus of claim 19 further comprising instructions when executed 
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An'apparatus for identifying key images in a document, comprising: 



on the processor fliat scan a bit-mapped representation of the image 
for text informatk n and convert the bit-mapped representation of 
the text informatic n into image text. 
The apparatus of claim 19 wherein the images include metadata having 
text information 2nd identifying the image text further comprises 
instructions wher executed on the processor that search metadata 
information associated with the image for text describing the 
image. 

The apparatus of claim 2 1 wherein the metadata information is compatible 

with hypertext markup language (HTML). 
The apparatus of claim 19 wherein generating the proximity factor further 
comprises instruc tions when executed on the processor that 
lexically analyze the imkge text associated with each image and 
each document keyword xb determine the degree of correlation 



between an image anc 
The apparatus of claim 19 wl j 
comprises instructions 



25. The apparatus of claim 19 



K a document keyword. 

irein geja^ating the proximity factor further 
€n executed on the processor that 
perform a phonetic-comparison between the image text associated 
with each image and each document keyword to determine the 
degree of correlation between an image and a document keyword. 

wherein generating the proximity factor further 
comprises instructions when executed on the processor that 
identify the location of the image in the document and 
measure the distance between the image in the document and a 
document keyword] 

26. The apparatus of claim 25 vijherein generating the proximity factor further 
comprises instructions when executed on the processor that 
determine the relevance of each document keyword to an image 
according to the distance between a document keyword and the 



image. 
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pans for extracting one or more document keywords from 
the document considered important in describing the document; 

meams for collecting one or more images associated with 
the document including information describing each image; 
5 means for generating a proximity factor for each image 

collected from the document and each document keyword that 
reflects the degree of correlation between the image and the 
document keyword; and 

means for determining the importance of each image 
10 according to an image metric that combines the proximity factors 

for each document\keyword and image pair. 
28. The apparatus of claim 27 \ further comprising: 

means for omering the document keywords according to an 
ordering criterion; and 

1 5 means for weighting the proximity factor associated with 

each document keywqrd and image pair based on the order of the 
document keyword. 
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